481 research outputs found
From Micro to Macro: Uncovering and Predicting Information Cascading Process with Behavioral Dynamics
Cascades are ubiquitous in various network environments. How to predict these
cascades is highly nontrivial in several vital applications, such as viral
marketing, epidemic prevention and traffic management. Most previous works
mainly focus on predicting the final cascade sizes. As cascades are typical
dynamic processes, it is always interesting and important to predict the
cascade size at any time, or predict the time when a cascade will reach a
certain size (e.g. an threshold for outbreak). In this paper, we unify all
these tasks into a fundamental problem: cascading process prediction. That is,
given the early stage of a cascade, how to predict its cumulative cascade size
of any later time? For such a challenging problem, how to understand the micro
mechanism that drives and generates the macro phenomenons (i.e. cascading
proceese) is essential. Here we introduce behavioral dynamics as the micro
mechanism to describe the dynamic process of a node's neighbors get infected by
a cascade after this node get infected (i.e. one-hop subcascades). Through
data-driven analysis, we find out the common principles and patterns lying in
behavioral dynamics and propose a novel Networked Weibull Regression model for
behavioral dynamics modeling. After that we propose a novel method for
predicting cascading processes by effectively aggregating behavioral dynamics,
and propose a scalable solution to approximate the cascading process with a
theoretical guarantee. We extensively evaluate the proposed method on a large
scale social network dataset. The results demonstrate that the proposed method
can significantly outperform other state-of-the-art baselines in multiple tasks
including cascade size prediction, outbreak time prediction and cascading
process prediction.Comment: 10 pages, 11 figure
Crowdsourcing Argumentation Structures in Chinese Hotel Reviews
Argumentation mining aims at automatically extracting the premises-claim
discourse structures in natural language texts. There is a great demand for
argumentation corpora for customer reviews. However, due to the controversial
nature of the argumentation annotation task, there exist very few large-scale
argumentation corpora for customer reviews. In this work, we novelly use the
crowdsourcing technique to collect argumentation annotations in Chinese hotel
reviews. As the first Chinese argumentation dataset, our corpus includes 4814
argument component annotations and 411 argument relation annotations, and its
annotations qualities are comparable to some widely used argumentation corpora
in other languages.Comment: 6 pages,3 figures,This article has been submitted to "The 2017 IEEE
International Conference on Systems, Man, and Cybernetics (SMC2017)
Association between high serum blood glucose lymphocyte ratio and all-cause mortality in non-traumatic cerebral hemorrhage: a retrospective analysis of the MIMIC-IV database
BackgroundThis study aimed to evaluate the association between the glucose-to-lymphocyte ratio (GLR) and all-cause mortality in intensive care unit (ICU) patients with Non-traumatic cerebral hemorrhage.MethodsThis is a retrospective cohort study. Baseline data and in-hospital prognosis from patients with non-traumatic cerebral hemorrhage admitted to the intensive care unit. Multivariate COX regression analysis was applied and adjusted hazard ratios (HR) and 95% predictive values with confidence intervals (CI) were calculated. Survival curves for the two groups of cases were plotted using K-M curves, and subgroup analyses were performed in one step. Using restricted cubic spline curves, we analyzed the potential linear relationship between GLR and outcome indicators.ResultsIn the Medical Information Mart for Intensive Care IV (MIMIC-IV database), we extracted 3,783 patients with nontraumatic intracerebral hemorrhage, and 1,806 patients were finally enrolled in the study after exclusion of missing values and patients with a short hospital stay. The overall ICU mortality rate was 8.2% (148/1806) and the in-hospital mortality rate was 12.5% (225/1806). The use of curve fitting yielded a significant linear relationship between GLR and both ICU mortality and in-hospital mortality. It also suggested a reference point at GLR=3.9. These patients were categorized into high and low subgroups based on the median value of their GLR (GLR = 3.9). Model comparisons based on multivariate COX regression analysis showed that in-hospital mortality was higher in the high GLR group after adjusting for all confounders (HR = 1.31, 95% CI: 1.04-1.47), while the ICU mortality in the high GLR group was (HR = 1.73, 95% CI: 1.18-2.52). Stratified analyses based on age, gender, race, GCS, BMI, and disease type showed stable correlations between the high GLR group and in-hospital and ICU mortality.ConclusionBased on our retrospective analysis, it is known that as the GLR increased, the in-hospital mortality rate and ICU mortality rate of patients with nontraumatic cerebral hemorrhage also increased progressively in the United States in a clear linear relationship. However, further studies are needed to confirm these findings
Using Argument-based Features to Predict and Analyse Review Helpfulness
We study the helpful product reviews identification problem in this paper. We
observe that the evidence-conclusion discourse relations, also known as
arguments, often appear in product reviews, and we hypothesise that some
argument-based features, e.g. the percentage of argumentative sentences, the
evidences-conclusions ratios, are good indicators of helpful reviews. To
validate this hypothesis, we manually annotate arguments in 110 hotel reviews,
and investigate the effectiveness of several combinations of argument-based
features. Experiments suggest that, when being used together with the
argument-based features, the state-of-the-art baseline features can enjoy a
performance boost (in terms of F1) of 11.01\% in average.Comment: 6 pages, EMNLP201
Cross-Scale Cost Aggregation for Stereo Matching
Human beings process stereoscopic correspondence across multiple scales.
However, this bio-inspiration is ignored by state-of-the-art cost aggregation
methods for dense stereo correspondence. In this paper, a generic cross-scale
cost aggregation framework is proposed to allow multi-scale interaction in cost
aggregation. We firstly reformulate cost aggregation from a unified
optimization perspective and show that different cost aggregation methods
essentially differ in the choices of similarity kernels. Then, an inter-scale
regularizer is introduced into optimization and solving this new optimization
problem leads to the proposed framework. Since the regularization term is
independent of the similarity kernel, various cost aggregation methods can be
integrated into the proposed general framework. We show that the cross-scale
framework is important as it effectively and efficiently expands
state-of-the-art cost aggregation methods and leads to significant
improvements, when evaluated on Middlebury, KITTI and New Tsukuba datasets.Comment: To Appear in 2013 IEEE Conference on Computer Vision and Pattern
Recognition (CVPR). 2014 (poster, 29.88%
Using Argument-based Features to Predict and Analyse Review Helpfulness
We study the helpful product reviews identification problem in this paper. We
observe that the evidence-conclusion discourse relations, also known as
arguments, often appear in product reviews, and we hypothesise that some
argument-based features, e.g. the percentage of argumentative sentences, the
evidences-conclusions ratios, are good indicators of helpful reviews. To
validate this hypothesis, we manually annotate arguments in 110 hotel reviews,
and investigate the effectiveness of several combinations of argument-based
features. Experiments suggest that, when being used together with the
argument-based features, the state-of-the-art baseline features can enjoy a
performance boost (in terms of F1) of 11.01\% in average.Comment: 6 pages, EMNLP201
- …